Time-Scale Information Measures for Text-Independent Phone Segmentation

نویسندگان

  • A. S. Cherniz
  • M. E. Torres
  • H. L. Rufiner
چکیده

In this work, speech parameterization based on the continuous multiresolution divergence is used to modify a textindependent phone segmentation algorithm. This encoding is employed as input and also replaces an stage of the segmentation procedure responsible for the estimation of the intensity of changes in signal features. The segmentation performance of this representation has been compared with the original algorithm using as input a classical Melbank parameterization and speech representation based on the continuous multiresolution divergence. The results indicate that the modification here proposed increases the ability of the algorithm to perform the segmentation task. This suggests that continuous multiresolution divergence provides valuable information related to acoustic features that take into account phoneme transitions. Moreover, this parameterization gives enough information for its direct use without further processing. Keywords— Information measures, Divergence, Multiresolution analysis, Automatic speech segmentation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phonemic segmentation using the generalised Gamma distribution and small sample Bayesian information criterion

In this work, we present a text-independent automatic phone segmentation algorithm based on the Bayesian Information Criterion. Speech segmentation at a phone level imposes high resolution requirements in the short-time analysis of the audio signal; otherwise the limited information available in such a small scale would be too restrictive for an efficient characterisation of the signal. In orde...

متن کامل

文本不特定之自動音素分段演算法 (Text-Independent Automatic Phone Segmentation Algorithm) [In Chinese]

This paper proposes a text-independent sequential phone boundary detection algorithm. Without any previous knowledge, an automatic phone segmentation system is constructed. The method is to sequentially search for a candidate phone boundary and follow by a verification process. The phone segmentation is accomplished when the phone boundaries are verified. The discrete wavelet transform is appli...

متن کامل

Confidence measures for phonetic segmentation of continuous speech

In the context of text-to-speech synthesis, this contribution deals with the segmentation of speech into phone units. Using an HMM based segmentation system, we proceed to compare several phone-level confidence measures to detect potential local mismatches between the phone labels and the acoustics. As well as serving this purpose, these confidence measures will help the system suggest a new lo...

متن کامل

Identifying unexpected words using in-context and out-of-context phoneme posteriors

The paper proposes and discusses a machine approach for identification of unexpected (zero or low probability) words. The approach is based on use of two parallel recognition channels, one channel employing sensory information from the speech signal together with a prior context information provided by the pronunciation dictionary and grammatical constraints, to estimate ‘in-context’ posterior ...

متن کامل

A Fast Algorithm for Korean Text Extraction and Segmentation from Subway Signboard Images Utilizing Smartphone Sensors

We present a fast algorithm for Korean text extraction and segmentation from subway signboards using smart phone sensors in order to minimize computational time and memory usage. The algorithm can be used as preprocessing steps for optical character recognition (OCR): binarization, text location, and segmentation. An image of a signboard captured by smart phone camera while holding smart phone ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009